Rank in Wordlist | Frequency | Word |
---|---|---|
4934 | 898 | 1,5 |
5162 | 853 | 2,5 |
8055 | 518 | 3,5 |
8989 | 460 | 4,5 |
15345 | 243 | 3.306,07 |
19580 | 178 | 5,5 |
21380 | 159 | 1,2 |
21713 | 156 | 7,5 |
22116 | 152 | 1,5% |
22339 | 150 | 1,3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
29839 | 102 | ВКП(б |
36437 | 77 | КП(б)Б |
38919 | 70 | БСДП(Г |
102369 | 16 | A(H1N1 |
129978 | 11 | КП(б |
146212 | 9 | БСДП(Грамада |
159077 | 8 | РКП(б |
171256 | 7 | КПБ(б |
178718 | 7 | нацыі(акрамя |
184503 | 6 | A(H3N2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
16576 | 221 | ўказаны). |
36437 | 77 | КП(б)Б |
62663 | 35 | г.д.). |
67802 | 31 | высылкай). |
79331 | 24 | %) |
110963 | 14 | %). |
116192 | 13 | 2)i |
127036 | 12 | праграма). |
128586 | 11 | 1)i |
141885 | 10 | музыка). |
Rank in Wordlist | Frequency | Word |
---|---|---|
3940 | 1134 | 50% |
4267 | 1048 | 20% |
4338 | 1028 | 30% |
4573 | 974 | 10% |
5713 | 764 | 100% |
5936 | 734 | 40% |
6375 | 677 | 80% |
7093 | 595 | 5% |
7187 | 588 | 90% |
7691 | 545 | 70% |
Rank in Wordlist | Frequency | Word |
---|---|---|
128936 | 11 | S&P |
145522 | 9 | Standard&Poor's |
205113 | 5 | Ernst&Young |
205205 | 5 | Mag&Leu |
232922 | 4 | Click&Roll |
273373 | 3 | B&W |
273686 | 3 | H&M |
273740 | 3 | Ilo&friends |
274159 | 3 | Standard&Poor''s |
274160 | 3 | Standart&Poors's |
Rank in Wordlist | Frequency | Word |
---|---|---|
65837 | 32 | $1 |
79330 | 24 | $2 |
81517 | 23 | $100 |
81518 | 23 | $500 |
83955 | 22 | $2,5 |
102205 | 16 | $10 |
106385 | 15 | $3 |
122049 | 12 | $30 |
122050 | 12 | $45 |
122051 | 12 | $50 |
Rank in Wordlist | Frequency | Word |
---|---|---|
231002 | 4 | %" |
Rank in Wordlist | Frequency | Word |
---|---|---|
1638 | 2693 | сур'ёзны» |
2138 | 2108 | з'яўляецца |
3729 | 1204 | сям'і |
5400 | 814 | аб'ём |
5953 | 733 | прэм'ер-міністра |
5971 | 730 | інтэрв'ю |
6586 | 651 | з'яўляюцца |
7171 | 589 | аб'ектаў |
7432 | 569 | сур'ёзна |
7587 | 555 | прэм'ер |
Rank in Wordlist | Frequency | Word |
---|---|---|
430 | 8928 | а/с |
1853 | 2428 | к/р |
9753 | 420 | с/с |
13673 | 280 | к/p |
14100 | 270 | п/п |
17923 | 200 | п/п. |
33957 | 85 | 2/3 |
37749 | 73 | 1/8 |
41638 | 64 | м/с |
47950 | 52 | Эўропа/Радыё |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots